Perceptual Domain Based Speech and Audio Coder
نویسندگان
چکیده
This paper applies a new auditory filterbank to wide band speech and audio coding. The coding algorithm is capable of producing high quality coded speech and audio, which account for temporal as well as spectral details. The analysis and synthesis are performed using a critical-bandrate auditory filterbank with superior auditory masking properties. The outputs of the analysis filters are processed to obtain a series of pulse trains that represent neural firing. Post and pre temporal masking models are applied to reduce the number of pulses in order to produce a compact timefrequency parameterization. The pulse amplitudes and positions are then coded using run-length coding algorithm.
منابع مشابه
Gaussian Mixture Model Based Coding of Speech and Audio
The transmission of speech and audio over communication channels has always required speech and audio coders with reasonable search and computational complexity and good performance relative to the corresponding distortion measure. This work introduces a coding scheme which works in a perceptual auditory domain. The input high dimensional frames of audio and speech are transformed to power spec...
متن کاملA Perceptually Based Embedded Subband Speech Coder - Speech and Audio Processing, IEEE Transactions on
A new scheme for robust, high-quality, embedded speech coding based on subband decomposition and perceptually optimized bit allocation and prioritization is presented. An infinte impulse response (IIR) quadrature mirror filterbank (QMF) performs subband decomposition. A perceptual model, computed using subband spectral analysis, optimizes the coder’s perceptual quality. Dynamic bit allocation a...
متن کاملNarrowband perceptual audio coding: enhancements for speech
This paper presents a bi-modal coding paradigm to compress narrowband audio signals at 8 kbit/s. In the general mode, the Enhanced Narrowband Audio Coder (ENPAC) exploits the characteristics of the human hearing system to adaptively code the perceptually important spectral components of the input audio. The other mode is employed to handle audio inputs with a strong harmonic structure. In that ...
متن کاملPerceptual Coding of Narrowband Audio Signals
New applications such as Internet broadcast and communications, consumer multimedia products, digital AM broadcast and satellite networks are emerging. Those applications require moderate audio quality without annoying artifacts at bit rates below 16 kbit/s. Although speech coders provide high speech quality at bit rates around 8 kbit/s, they perform poorly when encoding audio signals. In this ...
متن کاملJoint filterbanks for echo cancellation and audio coding
In this paper, joint structures for audio coding and echo cancellation are investigated, utilizing standard audio coders. Two types of audio coders are considered, coders based on cosine modulated filterbanks and coders based on the modified discrete cosine transform (MDCT). For the first coder type, two methods for combining such a coder with a subband echo canceler are proposed. The two metho...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001